Applying artificial vision models to human scene understanding

نویسندگان

  • Elissa Aminoff
  • Mariya Toneva
  • Abhinav Shrivastava
  • Xinlei Chen
  • Ishan Misra
  • Abhinav Gupta
  • Michael J. Tarr
چکیده

How do we understand the complex patterns of neural responses that underlie scene understanding? Studies of the network of brain regions held to be scene-selective-the parahippocampal/lingual region (PPA), the retrosplenial complex (RSC), and the occipital place area (TOS)-have typically focused on single visual dimensions (e.g., size), rather than the high-dimensional feature space in which scenes are likely to be neurally represented. Here we leverage well-specified artificial vision systems to explicate a more complex understanding of how scenes are encoded in this functional network. We correlated similarity matrices within three different scene-spaces arising from: (1) BOLD activity in scene-selective brain regions; (2) behavioral measured judgments of visually-perceived scene similarity; and (3) several different computer vision models. These correlations revealed: (1) models that relied on mid- and high-level scene attributes showed the highest correlations with the patterns of neural activity within the scene-selective network; (2) NEIL and SUN-the models that best accounted for the patterns obtained from PPA and TOS-were different from the GIST model that best accounted for the pattern obtained from RSC; (3) The best performing models outperformed behaviorally-measured judgments of scene similarity in accounting for neural data. One computer vision method-NEIL ("Never-Ending-Image-Learner"), which incorporates visual features learned as statistical regularities across web-scale numbers of scenes-showed significant correlations with neural activity in all three scene-selective regions and was one of the two models best able to account for variance in the PPA and TOS. We suggest that these results are a promising first step in explicating more fine-grained models of neural scene understanding, including developing a clearer picture of the division of labor among the components of the functional scene-selective brain network.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Representation Models and Machine Learning Techniques for Scene Classificatio

Scene classification is a fundamental process of human vision that allows us to efficiently and rapidly analyze our surroundings. Humans are able to recognize complex visual scenes at a single glance, despite the number of objects with different poses, colors, shadows and textures that may be contained in the scenes. Understanding the robustness and rapidness of this human ability has been a fo...

متن کامل

Face-based Chromatic Adaptation for Tagged Photo Collections

Colour constancy is the phenomenon where the chromaticity of a surface material appears to remain constant to an observer under different illuminants. Studies have shown human vision exhibits colour constancy to varying degrees. Digital camera sensors completely lack this property, resulting in pictures with unsightly colour casts. This paper presents a method for semi-automatically removing co...

متن کامل

Occlusion cues for image scene layering

1077-3142/$ see front matter 2012 Elsevier Inc. A http://dx.doi.org/10.1016/j.cviu.2012.10.001 q This paper has been recommended for acceptance ⇑ Corresponding author. E-mail address: [email protected] (Q. Li). To bring computer vision closer to human vision, we attempt to enable computer to understand the occlusion relationship in an image. In this paper, we propose five low dimensional...

متن کامل

Advances in Algorithms for Inference and Learning in Complex Probability Models

Computer vision is currently one of the most exciting areas of artificial intelligence research, largely because it has recently become possible to record, store and process large amounts of visual data. Impressive results have been obtained by applying discriminative techniques in an ad hoc fashion to large amounts of data, e.g., using support vector machines for detecting face patterns in ima...

متن کامل

A Computational Model of Extrastriate Visual Area MT on Motion Perception

Human vision system are sensitive to motion perception under complex scenes. Building motion attention models similar to human visual attention system should be very beneficial to computer vision and machine intelligence; meanwhile, it has been a challenging task due to the complexity of human brain and limited understanding of the mechanisms underlying the human vision system. This paper model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2015